Adaptive Semantic-Spatio-Temporal Graph Convolutional Network for Lip Reading

نویسندگان

چکیده

The goal of this work is to recognize words, phrases, and sentences being spoken by a talking face without given the audio. Current deep learning approaches for lip reading focus on exploring appearance optical flow information videos. However, these methods do not fully exploit characteristics motion. In addition flow, mouth contour deformation usually conveys significant that complementary others. modeling dynamic has received little attention than flow. work, we propose novel model contours called Adaptive Semantic-Spatio-Temporal Graph Convolution Network (ASST-GCN), go beyond previous automatically both spatial temporal from To combine contour, two-stream visual front-end network proposed. Experimental results demonstrate proposed method significantly outperforms state-of-the-art several large-scale benchmarks.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spatio-temporal Graph Convolutional Neural Network: A Deep Learning Framework for Traffic Forecasting

The goal of traffic forecasting is to predict the future vital indicators (such as speed, volume and density) of the local traffic network in reasonable response time. Due to the dynamics and complexity of traffic network flow, typical simulation experiments and classic statistical methods cannot satisfy the requirements of mid-and-long term forecasting. In this work, we propose a novel deep le...

متن کامل

Spatio - Temporal Adaptive Interlaced

In this paper, we propose two adaptive interlaced-to-progressive conversion techniques in which the adequacy of the estimated motion vector is evaluated. If the motion vector is unlikely to give a good temporal motion compensated interpolation result, spatial interpolation is favored or selected to avoid temporal artifacts. In the rst proposed interlaced-to-progressive conversion technique, cal...

متن کامل

Convolutional Learning of Spatio-temporal Features

We address the problem of learning good features for understanding video data. We introduce a model that learns latent representations of image sequences from pairs of successive images. The convolutional architecture of our model allows it to scale to realistic image sizes whilst using a compact parametrization. In experiments on the NORB dataset, we show our model extracts latent “flow fields...

متن کامل

Topology adaptive graph convolutional networks

Convolution acts as a local feature extractor in convolutional neural networks (CNNs). However, the convolution operation is not applicable when the input data is supported on an irregular graph such as with social networks, citation networks, or knowledge graphs. This paper proposes the topology adaptive graph convolutional network (TAGCN), a novel graph convolutional network that generalizes ...

متن کامل

Adaptive Graph Convolutional Neural Networks

Graph Convolutional Neural Networks (Graph CNNs) are generalizations of classical CNNs to handle graph data such as molecular data, point could and social networks. Current filters in graph CNNs are built for fixed and shared graph structure. However, for most real data, the graph structures varies in both size and connectivity. The paper proposes a generalized and flexible graph CNN taking dat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Multimedia

سال: 2022

ISSN: ['1520-9210', '1941-0077']

DOI: https://doi.org/10.1109/tmm.2021.3102433